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METHOD AND APPARATUS FOR VIDEO BIT-RATE CONTROL 



BACKGROUND 

This disclosure is related to bit-rate control for video coding 

In general rate, in bit-rate control processes, such as those employed 
for MPEG-2 and MPEG-4, see, for example, "Test Model 5," ISO/IEC 
JTC1/SC29/WG1U 1994 (hereinafter referred to as "TM5");"MPEG-4 
Video Verification Model, Version 15.0," ISO/IEC JTC1/SC29/WG11 
N3093, December 1999 (hereinafter referred to as "Q2"), the bit rate is 
computed based on the bits available and the last encoded frame. If the last 
frame is complex and uses excessive bits, more bits should be assigned to 
the frame to reflect its complexity. However, if there are fewer bits left for 
encoding, fewer bits will actually be assigned. The number of available bits 
or "bit budget" may depend on a number of different considerations, such as 
bandwidth, etc. 



A weighted average reflects a compromise between 
accommodating complexity and meeting the "bit budget." For 
example, Q2, cited previously, models the encoder rate distortion 
function as follows: 



where 
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R is encoding bit count 

S is encoding complexity measured by mean absolute 

difference (MAD) per frame 

Q is quantization parameter; and 

Xj , X2 are the modeling parameters. 

The Q2 approach estimates the modeling parameters (Xi and 
X2) using the least square (LS) method based on previous encoding 
data. Then, the above quadratic equation is solved for Q. this method 
solves the LS equation based on previous encoding data, typically 
Obtained from up to 20 previous frames. So the approach is not only 
complex but also typically employs a large amount of memory. 
Another problem with the approach, however, is that it does not 
always meet the desired bit rate. For example, the desired bit rate is 
not necessarily met for all test images recommended by the MPEG 
standard committee. Furthermore, experimental results show that the 
Q2 approach maintains the desired bit-rate by indiscriminately 
dropping frames. This, of course, may degrade the reconstructed 
video quaUty. A need, therefore, exists for improved bit-rate control 
processes. 



BRIEF DESCRIPTION OF THE DRAWINGS 

Subject matter is particularly pointed out ^d distinctly claimed in 
concluding portion of the specification. The claimed subject matter, 
however, both as to organization and method of operation, together with 
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objects, features, and advantages thereof, may best be understood by 
reference of the following detailed description when read with the 
accompanying drawings in which: 

FIG. 1 is an image produced after being encoded with the Q2 
approach; and 

FIG. 2 is an image produced after being encoded using an 
embodiment of the claimed subject matter. 



DETAILED DESCRIPTION 

In the following detailed description, numerous specific details are 
set forth in order to provide a thorough understanding of the claimed subject 
matter. However, it will be understood by those skilled in the art that the 
claimed subject matter may be practiced without these specific details. In 
other instances, well-known metfiods, procedures, components and circuits 
have not been described in detail in order so as not to obscure the claimed 
subject matter. 

In video image processing, control of the video bit-rate is 
applied to achieve a bit-rate target during encoding. Prediction mode 
decisions, motion vector choices and displaced frame difference 
(DFD) coding fidelity may affect this video bit-rate. Furthermore, 
once the mode and motion vectors are chosen, after motion 
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estimation, quantization parameters (Qp) are applied, which may also 
affect rate control. 

Several approaches for video bit-rate control have been 
proposed. In MPEG-4, reference software is provided by the MPEG 
standard committee referred to as the "Q2 algorithm" or "Q2 
approach" (hereinafter "Q2"). Q2 has been employed at times as a 
benchmark for rate control. Likewise, the TM5 process is used in 
MPEG-2 reference software . These processes have disadvantages, for 
example, in the sense of computational complexity. Furthermore, 
they do not guarantee the desired bit-rate is achieved, and, hence, 
frames may be dropped to achieve the desired bit rate. 

As is well-known, in MPEG "Generic Coding of Moving 
Pictures and Associated Audio Information: Video," ISO/IEC 13818- 
2: International Standard 1995, there are three types of pictures or 
video frames: I, P, and B pictures. I pictures are intra-frame coded 
without reference to any other frames. The P picmres are predictive 
coded using previously reconstructed reference frames. The B pictures 
are usually coded using backward and forward reference frames and 
typically achieve better compression than the other types of pictures. 
Motion vectors are computed by a Sum-of- Absolute-Difference (SAD) 
based block matching scheme. A motion vector MV is represented by 
two components (MVx, MVyX where MVx and MVy are the motion 
vector components in horizontal and vertical directions, respectively. 
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where 

(xo, yo) - Upper left comer coordinates of the 
current macroblock 

C[x, y] - Current macroblock luminance samples 
y] - Reconstructed previous frame luminance 
samples 

S - Search range: ^x,y):-l6 < x, 3; < i 6 } 

It is noted that MPEG is provided merely as one example 
embodiment or application and the claimed subject matter is not 
limited in scope to MPEG or to any other video-related standard. 
However, again, typically in MPEG, the SAD values are computed for 
selected search points in the search space (S) depending upon the 
motion estimation process. The motion vector (MVx, MVy) is selected 
based on the displacement of the search point that results in a 
minimum SAD among the SAD values in the search space. The SADs 
provide information about the macroblocks as well as the video 
frames. This information may be utilized to determine the Qp for the 
video bit-rate control. For I pictures, the SAD with respect to the 
average value of its own block is employed. 

One embodiment of a video bit-rate control process in 
accordance with the claimed subject matter provides adaptive 
quantization for macroblocks while maintaining the bit budget for the 
frame. This embodiment reduces the computational overhead using a 
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pre-calculated SAD-rate relationship depending at least in part on Qp. 
Thus, computational complexity is reduced. Simulation results show 
that this embodiment gives better coding efficiency with better 
reproduced image quality due at least in part to less blocking artifacts 
than the Q2 rate conti-ol approach recommended in MPEG-4 video 
coding. Of course, the claimed subject matter is again not limited to 
the particular embodiment described, as shall become more clear from 
the following description. 

In this embodiment, a quantization (mquant) method is 
applied that calculates tiie appropriate Qp value at a macroblock level 
depending at least in part on the macroblock SAD, while maintaining 
the frame bit budget using a simplified probability distribution. Video 
data changes occur in botii spatial and temporal domains witiiin 
frames. In this embodiment, therefore, adaptive quantization is 
applied below the frame level. Unlike a frame-level quantization 
method, such as, for example, Q2, mquant provides conti:ollability 
mside a frame at a macroblock level, to, therefore, handle the spatially 
active scene. 

Since SAD is generally used for motion estimation, SAD is 
applied here to classify input images. A relationship is estimated 
between the SAD of a macroblock and its associated bit-rate 
depending on the Qp values. In this embodiment, the bit rate is 
calculated for macroblocks by varying Qp within acceptable ranges. 
Then the rate is averaged depending on the index for the quantization 
step sizes. This generates a (SAD, relationship for the 
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quantization parameter, where Rj is the bit-rate. Next, the SAD is 
quantized into a number of bins (no_bin) using the equation below. 
The size of a bin {bin_size) is determined, in this embodiment by the 
total range, depending here on the picture type, divided by the number 
of bins (no_bin), altiiough the claimed subject matter is not limited to 
necessarily employing picture type or to being employed with coding 
that necessarily employs different picture types. 



. . SAD 

index = , rai 

bin _ size 

where 



, . . range 
bin _ size = — 

no bin 



In this example, no_bm=8. So there are 8 classes of 
macroblocks for each I, P and B pictures. Next, the SAD and bit-rate 
relationship is tabulated for the Qp values depending on the picture 
types, again, for this particular embodiment. In the encoder, the index 
for a macroblock is calculated using the equation above and pre- 
calculated SAD, that is, in this embodiment, the SAD obtained from 
motion estimation. After calculating this index, a Qp value is 
determined tiiat meets the given or desired bit-rate for the current 
frame, as explained in more detail hereinafter. 



For a frame, extreme Qp values are employed initially. Using 
these values, the bits for the macroblocks in tiie frame are added using 
the index (quantized SAD) and Qp value from look-up tables. If the 
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added bits are still less than the desired bit rate, this process is 
repeated by decreasing the Qp value until the sum is greater than the 
given budget. 

For this embodiment or example, Qp_max = 20 for I,P 
pictures and 28 for B pictures, and Qp_min = 3 for I,P pictures and 8 
for B pictures. These values are chosen to keep the image quality of 
reference images (I and P-pictures, in this embodiment) and provide a 
higher compression for B-pictures. 

begin 

for (l=Qp_max; 1>= Qp_niin; 
sum_rate = 0; 
for (i=0; i<N; i++){ 

for(j=0;j<M;j++){ 

sum_rate = 
sum_rate+RATE[indexl [1] ; 
} 

} 

if(sum_rate>budget) break; 

} 

Qp = 1; 1"^ your desired Qp for current frame */ 

end 

where 

N: image height divided by 16 
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M: image width divided by 16 ( NxM represents number 
of macroblocks per frame) 

sumjrate: represents the estimated bits spent using the index 
and Qp relationship 

Budget: allocated bits for the current frame. 

Once the Qp value for a frame is determined, for an MPEG-4 
implementation, although, of course, the claimed subject matter is not 
limited in scope in this respect, the Qp value is adjusted macroblock- 
wise. To implement this feature effectively, the macroblocks are 
divided into three groups based on their activity measured by a 
macroblock's SAD, Different Qp values are then applied to the 
macroblocks depending on the groups they belong to; however, the 
overall Qp for the frame should be the one previously determined. A 
potential strategy to implement such an approach is to apply a higher 
Qp to a higher SAD class, as described in more detail below. 
However, alternate strategies may also be applied. 

A strategy in which a higher Qp is applied to a higher SAD 
class, denoted here as "SAD+mq," is designed to maintain the desired 
bit budget. This approach may, therefore, reduce the amount of bits 
generated for a highly active macroblock. On the contrary, a strategy 
in which a lower Qp is applied for a higher SAD class, denoted here 
as "SAD-mq," may maintain the quality of video more successfully. 
This approach may allot more bits to the macroblocks containing 
higher activity. 
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To determine the Qp to assign to a specific macroblock, the 
number of occurrences for each bin, sadjcount, may be counted. 
This permits calculation of threshold values that partition the 
macroblocks in the currently coded frame into three groups, while 
maintaining the desired bit budget. These thresholds divide the 
macroblocks into groups of roughly equal probability to produce a 
cumulative distribution. Therefore, specific Qp values may be 
assigned so that the overall Qp of the frame, as previously determined, 
allows the bit budget to be maintained. 

The number of occurrences for each SAD bin is counted to find 
the previously described thresholds, and, then assign bits for the 
macroblocks. For P and B pictures, the SAD of the macroblocks is 
calculated in MPEG during the motion estimation. Additional 
complexity, however, is present for I pictures. Its SAD is calculated 
because one is not calculated for such pictures during motion 
estimation. Once the rate is determined, the SAD are quantized by 
division. 

This embodiment implemented in C code is as follows: 
begin 

thl = budget /3; 
th2 = 2*thl; 

temp = 0; 

for (i=0; i<N_BIN; i++) { 

temp = temp + sad__count [i] *rate [index] [Qp] ; 
if (temp > thl) { 
tlil= i; 
break; 

} 
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} 

temp = 0; 

for {i=0; i<N_BIN; i++) { 

temp = temp + sad_count [i] * 
rate [index] [Qp] ; 

if (temp > th2) { 
th2= in- 
break; 

} 

} 

k = 0; 

for (i=0; i<vop->height/16; i++) { 

for (j=0; j<vop->width/16; j++){ 
if (sad[i] [j]<=thl) 

qp[k] = qp[k]+l; 
else if (sad[i] [j]>=th2) qp[k] = 

qp[k]-l; 

k++; 

} 

} 

end. 



The Qp value to assign to a specific macroblock is 
accomplished in this embodiment using dquant, which is defined as 
follows: 

dquant = Qp(current MB) -Qp(previous MB) 

For this embodiment, an acceptable range of dquant is ± 2, although, 
again, the claimed subject matter is not hmited in scope in this 
respect. Once a desired Qp for the frame is obtained, the Qp value is 
perturbed ± 1 so that the maximum change will be ±2. However, the 



042390.P11941 



Qp value for the macroblocks is adjusted based on the thresholds 
previously determined. 

In summary, for this embodiment, once the Qp value is 
determined for the frame, the Qp value for specific macroblocks is 
adjusted based at least in part on the SAD for the macroblock. 
However, the Qp value adjustment at the macroblock level is 
conducted so that the overall Qp value for the frame is maintained, 
where one-third are adjusted up and one-third are adjusted down; 
however, which third is adjusted up and which third is adjusted down 
depends on the strategy being pursued as previously discussed. 
Furthermore, the adjustment is conducted so that the acceptable range 
for dquant is maintained, as previously described. 

By contrast, Q2 estimates the modeling parameters using least 
square (LS) method based on previous encoding data, as previously 
described, The following equation (Eq, [5]) is solved: 

Y = AX^X=(A^Ay'A^Y , 
where 

matrix transpose of A 

After obtaining solutions of the above Eq. [5], the quadratic equation 
(Eq. [l])is solved with respect to Q. Solving the LS equation involves 
a matrix inverse calculation based on previous encoding data, up to in 
many cases as much as 20 previous frames. Therefore, computational 
complexity, as well as memory utilization, may be significant. 
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Table 1, therefore, compares the performance between Q2 and 
the previously described embodiment. Experiments were performed 
using 2 video sequences. The experimental results may be 
summarized as follows: 

♦ The embodiment described maintains the desired bit 
budget closely with better compression efficiency while 
the Q2 bit-rate approach may drop frames,, as shown in 
Table 1. 

♦ Also, Q2 often yields over bit-budget compared to the 
desired budget, as shown in Table 1. The previously 
described embodiment yields bit-budgets close to the 
desired one. 

♦ PSNR results of the previously described embodiment 
are comparable to Q2; however, less blocking artifacts 
result from the previously described embodiment, as 
shown in FIGs. 1 & 2. 
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Table 1 



Firebird 

(fast motion) 


Q2 

Rate control 


SAD+mq 


SAD-mq 


Bits/Frame 
(desired bits/frame= 32768) 


34405 


32898 


32922 


Number of Frames dropped 


70 out of 200 
original frames 


NONE 


NONE 




Silent voice 

(moderate motion) 


Q2 

Rate control 


SAD+mq 


SAD-mq 


Bits/Frame 
(desired bits/frame=17476) 


18565 


17396 


17484 


Number of Frames dropped 


0 


0 


0 



The embodiment described, therefore, provides several 
improvements compared to state-of-the-art methodologies, such as 
Q2. Some of these improvements may include: 

♦ Utilizing the SAD computation involved in motion 
estimation. 

♦ Improved adherence to target bit rate. 

♦ Compression efficiency. 

♦ Flexibility, reflected in the alternate quantization step 
size assignment strategies presented 
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♦ Reducing blocking artifacts, as shown in Figures 1 and 
2. 

♦ Computationally efficiency and reduced complexity, 
compared to Q2, in particular. 

♦ Suitability for low-power applications and 
implementations. 

♦ Adaptable to include picture level rate control. 

It will, of course, be understood that, although particular embodiments 
have just been described, the claimed subject matter is not limited in scope 
to a particular embodiment or implementation. For example, one 
embodiment may be in hardware, whereas another embodiment may be in 
software. Likewise, an embodiment may be in firmware, or any 
combination of hardware, software, or firmware, for example. Likewise, 
although the claimed subject matter is not limited in scope in this respect, 
one embodiment may comprise an article, such as a storage medium. Such a 
storage medium, such as, for example, a CD-ROM, or a disk, may have 
stored thereon instructions, which when executed by a system, such as a 
computer system or platform, or a computing system, for example, may 
result in an embodiment of a method in accordance with the claimed subject 
matter being executed, such as an embodiment of a method of implementing 
video bit rate control, for example, as previously described. For example, an 
image processing platform or an image processing system may include an 
image processing unit, an image input/output device and/or memory. 



While certain features of the claimed subject matter have been 
illustrated and described herein, many modifications, substitutions, changes 
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and equivalents will now occur to those skilled in the art. It is, therefore, to 
be understood that the appended claims are intended to cover all such 
modifications and changes as fall within the true spirit of the claimed subject 
matter. 



17 



